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ABSTRACT 

Management  science  models  often  require  estimates  of  distances 
between  points  in  a road  network  based  on  the  point  coordinates.  These 
estimates  are  typically  derived  using  rectangular  or  Euclidean  distance  func- 
tions. The  present  investigation  employs  more  general  functions  to  estimate 
samples  of  urban  and  rural  road  distances.  Proofs  of  conditions  for  the 
convexity  of  the  functions  are  given.  Statistical  comparisons  are  made  of 
prediction  accuracy  in  order  to  establish  relative  merit.  Sensitivity  of  the 
best  parameters  of  the  functions  with  respect  to  hfferent  geographical  areas 
suggests  the  importance  of  using  empirical  distance  functions. 
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1.  Introduction 

The  research  described  here  was  c:. fried  out  to  fit  i it!,  -it  ' :1  f \.cl  V 
to  sets  of  urban  and  rural  ro.M  distance  data.  The  objective  of  TV  study  \ 
to  evaluate  distance  predicting  functions.  In  a prevlov  study  (C),  seven 
different  functions  were  Investigated  which  transfer .n  coordinates  of  pof- 
into  csll  .ites  of  road  distances  between  those  points.  The  function  vhi> 
gave  superior  accuracy  was 
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where  q= (q^ , q2)  and  r-O^,  r2>  arc  points  in  two-dimensional  space  and  k, 
p and  s are  parameters.  The  ciost  suitable  value  for  s was  not  markedly 
different  froa  that  of  p when  the  function  war.  "fitted"  tc  tvo  different  set.. 
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of  sampled  inter-city  road  distances.  The  function  d(q,  r;  k,  p,  p)  i.ms  sug- 
gested for  practical  use  since  one  less  parameter  must  be  estimated  fin  the 

data . 


As  many  locational  analyses  are  ncconpl ished  within  an  urban  or  rural 
setting,  it  seems  appropriate  to  investigate  whether  such  distances  can  ' .0 
be  modeled  more  accurately  than  with  the  customary  rectangular  or  Eucliuca, 
distance  functions.  The  data  sets  used  in  this  paper  veie  samples  of  road 
distances  in  a Wisconsin  rural  region  and  in  the  Wisconsin  cities  of  Madison 
and  Milwaukee.  In  addition,  sampled  road  distances  were  included  from  the 
cities  of  Canton,  Columbus,  and  Toledo  in  Ohio.  These  cities  were  chosen  to 
provide  a spectrum  of  road  network  regularity  and  varying  degrees  of  accom- 
modations to  physical  obstruct  ions. 

1'lve  distance  function:,  . re  established  as  models  for  consider." ti.  :: . 


These  are  denoted  by: 

2 

dj^q,  r)  k T.  | q A <0 ) - ri(0)|,  k ^ 0,  where  q^(0)  - q^^  cos  C + sin  0, 

1“1 

q^fO)  " -q,  siri  0 + q^  cos  0 end  similarly  for  r^(0), 
d2(q,  r)  - d(q,  r;  k,  2,  2),  k _>  0, 


<•3(4.  O " d(q,  r;  V,  p,  p),  k _>  0,  p > 1, 


d,(q,  O “ d(q,  r;  k,  p,  s),  k >_  0,  p >.  1,  p >_  s, 


dj(q,  r)  - |(q-r)  M(q-r)]  , where  M - 


mj  ra?/2 
mz/2  tn 


is  a positive  definite 


symmetric  matrix,  (q-r)  is  taken  to  he  a 2 * 1 column  vector, 
and  a prime  denotes  the  transpose  operation. 
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d^  is  the  rectangular  distance  function  with  axes  rotated  through  an  angle  0 
from  the  original  coordinate  axes,  trull  ipl  ied  by  the  constant  I.  d^,  is  the 
Euclidean  metric.  d^  and  d^  replace  the  square  root  and  square  of  the 
Euclidean  metric  by  1 and  2 parameters,  respectively,  d^  provides  for 
weighting  the  squared  coordinate  differences  differently  as  well  as  a 
weighted  cross-product  term  within  the  square  root.  Unit  "circles"  given  by 
the  points  satisfying  d^fq.O)  ■»  1,  f«*l,2,3,  for  k=l  , p >_  1 arc  syi.-m  trie  with 
respect  lo  orthogonal  coordinate  axes.  However,  dt(q,0)  » 1 generates  a :ec 
of  points  q at  unit  distance  from  the  crigin  which  for.  s an  ellipse.  ... 

true  since  d^(q,0)  *=  [q  !!q]  5 c 1 holds  only  if  q Mq  = 1.  The  locus  of  poi. 

, 2 

q such  that  q Mq  ■ 1 forms  an  ellipse  when  tr.^  < 4m^mj  and  this  inequality 
holds  since  M is  positive  definite.  Consequently,  a different  form  of  direc- 
tional bias  in  distances  between  points  is  modeled  using  d^  versus  using  , 
say.  Specifically,  the  directional  bins  inherent  in  d^  is  such  that  the 
travel  direction  of  greatest  ease  is  perpendicular  to  th:t  cf  greatest 
difficulty.  For  these  directions  are  at  angles  to  one  another,  d^ 
lo  invariant  under  rotation  of  the  y.cs  since  the  best  direction  of  axes  is 
chosen  through  the  parameter  idj  in  accordance  with  a best-fit  criterion. 

Except  for  the  Euclidean  distance  function  the  remaining  functions  are  not 
invariant  under  rotation  of  axes;  the  analyst's  choice  of  axes  affects  the 
ability  of  d^,  d^  and  d^  to  accurately  predict  distances.  The  inclusion  of 
0 in  d^(q,r)  allows  for  a properly  oriented  coordinate  system  to  study  the 
ability  of  the  rectangular  distance  function  to  predict  urban  road  distances. 
dj(q,r)  has  but  three  parameters  which  must  be  estimated  since  M is  symmetric 
and  since  any  multiplicative  constant  k can  be  assumed  to  have  been  factored 
into  the  values  of  m^,  m2,  and  n^.  The  Indicated  conditions  on  the  parameters 
are  required  for  convexity.  The  authors  hive  shown  that  d^Cq.r)  is  convex  (3). 
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The  result  '.  In  t lie  appendix  establish  that  d^(>],r)  and  d^(q,r)  are  ..Iso 
convex . 

Empirical  distance  function.':  such  nr.  d^,  d/  and  d^  can  be  used  in  a wide 
range  of  applications.  hour  specific  applications  which  h.ive  core  to  the 
authors’  attention  are  discussed  here. 

1.  Vehicle  Tra ve  1 T imps. 

Kolesar,  Walker,  and  Hausner  (b)  have  formulated  and  validated  a model 
for  predictin'’,  fire  engine  travel  tine  as  follows: 

[ 2(P/a)!i  , if  1)  < 2d  , 

T(D)  =*  < 

|vc/a  + D/Vc,  if  1)  > 2dc, 

where  T it  average  fire  engine  travel  tine,  Ii  is  the  length  of  the  run, 
a *»  acceleration,  dc  » distance  required  to  achieve  cruising  velocity,  and 
Vc  “ cruising  velocity. 

It  would  si,.,  logical  that  tin  model  or  a variation  of  it  could  he  usc-d 
for  many  oth:  r types  of  vehicle  travel  tires;  ve  note  that  the  indtp. i l.nt 
variable  of  1 1:  e trod  el  is  the  travel  distance.  bring  the  empirical  ielh:ds 
we  suggest  h<  . e , given  the  coordinates  of  any  point  in  the  area  to  be  'crvnd 
and  the  coordinates  of  the  service  center,  the  travel  distance  ray  he  generated 
within  the  model.  V.licn  large  numbers  of  points  are  to  be  considered  or  the 
points  are  not  known  in  advance  (i.e.,  may  be  generated  randomly  or  iteratively 
by  the  model),  the  advantages  of  using  an  empirical  distance  function  are 
obvious . 

2 . char  art  o r <7  at  io .:  of_ko  uj_  Networks  . 

The  hcr.t-fit  parameters  of  the  distance  functions  provide  for  reduction 
of  large  amounts  of  distance  data  to  a form  which  makes  possible  certain 
generalizations  about  a road  network.  Consider  d(q,  r;  k,  p,  p) . 11  k and 

p B 1 , the  network  Is  basically  a rectangular  grid.  As  k increases  while 


n-r  t 

Bbl 


a;  j ' " ' 'V  r 
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p " 1 there  remains  a rectangular  bias  such  tbit  travel  In  most  efficient 
parallel  to  the  orthogonal  coordinate  axes  and  least  efficient  at  an;. las 

to  the  axes;  hut  there  is  an  Increasing  nonlinearity  in  the  roadway.  A 
continuin')  of  rectangular  bias  is  measured  along  1 <_  p < 2 . If  p = 2,  the 
network  is  highly  developed  since  efficiency  of  travel  is  not  r.  1 gnif  J cant  l> 
affected  by  directionality.  It  p > 2 the  directional  bias  is  inverted  sine, 
travel  is  most  efficient  at  45°  angles  to  the  axes  and  least  efficient 
parallel  to  the  axes. 

The  implicit  assumption  here  is  that  the  effect;  of  h er.d  p are  sop 
which  nay  not  be  strictly  valid;  c.g.,  k cay  not  be  inter; retable  as  the  reic 
measure  of  "curvature"  separate  and  apart  froa  p.  Similar  analyses  arc 
possible  for  the  remaining  distance  functions  considered  in  this  study. 

3.  Verifying  Distances  in  Road  S’. forks. 

As  Ginsburgh  and  Hansen  (3]  suggest,  it  is  not  uncommon  that  road  nct- 
voik  data  arc  unreliable.  One  of  their  suggested  remedies  is  to  verify  the 
ere  length  between  nodes  i and  J whenever  this  length  does  not  lie  in  the 

<1  A * 

confidence"  interval  [(l-c)d^,  (l-tc)d^].  Here  d is  the  ert i.vatc J di.t.r.cc 

and  c is  heurlstlcally  dctctr.lned.  A transportation  manager  or  urban  planner 

* 

might  screen  road  network  data  using  this  approach  replacing  d^,  by  one  oi  the 
estimating  functions  considered  in  this  study  (if  node  coordinates  can  be 
obtained).  We  have  found  this  to  be  a useful  technique  for  verifying  data  for 
traveling  salesman  and  truck  dispatching  models. 

4.  Facilities  location  Models. 

In  facilities  location  models  the  objective  is  to  locate  private  or 
public  units  such  as  factories,  distribution  centers,  ambulance  bases,  postal 
stations,  etc.,  to  optimize  an  effectiveness  criterion  involving  distances. 
Empirical  distance  functions  can  improve  estimation  of  road  travel  distances 
to  hypothetical  facility  locations  which  nay  be  anywhere  in  the  two-d Iracnsienal 
continuum  or  obviate  the  need  to  individually  measure  myriads  of  distances 
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when  facility  locations  ire  restricted  to  a finite  set  of  candidates.  An 
extensive  t tbl  ii  ■ -raphy  of  location  theory  llteiaturc  is  given  by  Francis 
and  Colei  , ti  in  [2]  and  a literature  review  of  location  in  continuous  sp.ee 
by  Wesol ow..i.y  (9).  Convexity  properties  of  the  distance  functions  nay  he 
important  to  the  user  of  continuous  space  location  models.  It  is  in  response 
to  this  need  that  ve  have  Included  a discussion  of  convexity  properties. 

2.  Design  of  the  Study 

Ohio  city  distances  in  r.iles  were  derived  from  data  supplied  by  the 
Ohio  Department  of  Transportation.  The  data  consisted  of  shortest  path 
distances  between  each  of  a selected  number  of  centroids  in  a given  city. 

The  rectangular  coordinates  in  nilcs  of  each  centroid  were  also  supplied. 

The  selected  centroids  were  chosen  to  give  a "representative"  spread  of 
points  within  the  particular  urban  boundaries.  From  these,  fifteen  points 
were  chosen  randomly  for  each  city.  These  points  were  used  to  generate  a 
sample  of  105  actual  real  distances — one  for  the  distance  between  each  pair 
ol  points,  ’..'isconsia  road  distances  were  calculated  using  published  rev.d 
naps.  An  orthogonal  coordinate  system  was  defined  and  the  fifteen  sample 
generating  points  were  selected  by  randomly  determining  pairs  of  coordinates 
and  discarding  points  which  fell  outside  the  study  area.  Distances  and  point 
coordinates  were  measured  directly  from  the  maps  in  eighths  of  inches.  This 
Introduces  a difference  in  the  scale  of  measurement  between  the  Ohio  and 
Wisconsin  data.  This  scale  difference  explains  the  larger  magnitude  of  values 
of  the  goodness-of- f it  criteria  ADj  and  SD^  in  Tables  1-5  for  Wisconsin  data 
relative  to  Ohio  data. 

Nordbeck  (7)  states  that  a division  of  a city  is  necessary  when  modeling 
road  distances  if  a river  effects  a separation  of  the  road  net.  This  is  true 
since  the  road  net  between  a pair  of  points,  one  on  each  side  of  the  liver, 
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1b  not  homogeneous  when  travel  must  be  routed  via  bridges.  To  explore  tb. 
concern  in  the  study,  a random  sample  of  flftern  of  t tie  centrei  av.:  i lab' 
toi  Toledo  were  chosen  fron  those  to  the  west  of  the  liaur.ee  F.iver  which 
divides  t lie  city.  This  constitutes  the  seventh  set  of  sample  data  which  ..  .1 
be  designated  the  Toledo  Subarea. 

Fitting  Criteria 

As  in  the  previous  study,  two  criteria  of  gcodness-of-f it  v rc  used  : 
measure  t lie  accuracy  of  the  functions.  The  first  is  the  minimization  of 
cuo  of  absolute  deviations  given  by 

AD£  ■=  t |d£(a .,  aL)  - A(a  , a )|, 
j<t  J J 

vhcre  j^l 14;  t=J+l,...,15  and  A(aj,  a£)  Is  the  actual  road  distance 

between  points  aj  and  a£.  This  criterion  requires  estimation  of  greater 
actual  distances  relatively  uorc  accurately  than  shorter  distance..  The 
cecond  criterion  Is  the  minimization  of  a sum  of  squares  given  by 

2 

SI),  » i { [d,  (a. , a ) •-  A(a. , a )]/^'.  (<■..",  u£))  . 

j<t 

Division  by  </ A (a  ^ ) normalizes  the  squared  deviation  and  renders  this 
criterion  more  sensitive  than  the  first  to  large  values  of  |d£(a^,at)  - ^ , *2 1 ) | 

In  relation  to  Afa^,  at).  In  this  way  SD£  offers  an  alternative  notion  of 
goodness-of-f It . 

Parameter  Definition 

The  parameters  of  the  functions  were  chosen  as  those  which  best  fit  the 
given  criterion  for  the  given  sample.  The  "best"  parameter  values  for  each 
sample  depend  on  the  criterion  used.  Computer  programs  were  developed  to 
perform  exhaustive  searches  for  optimal  parameters  within  chosen  intervals. 


>'or  d(q,  r;  >,  p,  p)  t tic  intervals  of  search  were  lf[0.S0,  2.2.c']  and 
pc | 0.90,  2.29].  This  choice  v.is  made  to  allow  for  the  hest-flt  value  cf  e-ch 
parameter  to  fall  outside  the  expected  range  from  1 to  2.  for  d^  the-  intervals 
of  search  t/erc  lc[0.E,  1.25)  and  0c[0°,  90°],  while  for  d-,  the  search  infeival 
vas  he  [0.8,  2.2°].  The  parameter  search  in  the  3-parameter  distance  models 
was  conducted  In  the  following  way,  for  each  of  the  2 criteria. 

first  the  criterion  function  was  evaluated  at  every  grid-point  in  a unit 
cube  in  the  parameter  space,  using  a grid-width  of  0.1.  In  ..Inost  all  c 
the  minimum  value  of  the  criterion  function  was  obtained  at  an  interior  j hit 
of  the  cube.  Where  this  did  not  happen,  the  process  was  repeated  afeer 
shifting  the  cube  in  the  appropriate  direction,  until  an  interior  minimum  on 
the  0.1-grld  was  formed. 

At  this  point  a search  was  conducted  on  the  grid-points  in  a cube  with 
side-length  0.2  centered  on  the  best  point  in  the  coarse  grid,  using  a grid 
width  of  0,01.  When  the  uinii.-.uui  value  of  the  criterion  function  vas  oV.r:.:  •! 
at  an  interior  point  of  the  cu’c,  the  paretic  ter  se_ich  w3s  terminated.  Tills 
happened  with  both  criteria  for  all  geograph  cal  a l ca a , when  d<(  was  c .ployed. 
With  d^,  however,  in  many  cases  the  search  over  the  fine  grid  had  to  be  re- 
peated several  times.  Each  lime  the  cube  was  shifted  in  an  appropriate 
direction  until  an  interior  minimum  was  found. 

The  search  procedure  employed  docs  not  ensure  that  a global  optimum  (to 
2 decimal  points)  was  found.  However,  several  of  the  parameter  values  wore 
verified  by  doing  a total  enumeration  on  a relatively  coarse  grid  (.1).  Va 
feel  confident,  therefore,  that  the  reported  parameter  values  in  most  cases 
are  optimal.  Where  this  is  not  the  case,  the  reported  figures  are  probably 
near-optimal,  at  least  with  respect  to  the  value  of  the  criterion  function. 
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3.  Statistical  Tests 

The  differing  estimating  accuracy  of  the  distance  functions  can  be 
associated  with  statistical  significance  using  the  t-test  for  matched  pairs. 
Define  Xj  (a^  , a^)  ■ | d £ ( a j » a^)  - A(aj,  a t ) | / j , a^)  and  the  difference 
dV  (3j , at)  *>  Xg(aj , a^)  - Xjfoj,  at).  Each  such  difference  is  between  two 
observations  on  the  care  selected  road  distance.  Division  by  /A(a^ , at)  is 
used  to  accomplish  horoscedasticity.  This  was  necessitated  since  the  varia- 
tion in  |d^(aj,  a^)  - A (a ^ , a^)|  tends  to  be  directly  related  to  the  magnitude 
of  Ma j , at).  These  definitions  lead  naturally  to  a test  of  differences 
between  means  based  on  matched  pairs.  This  test,  as  opposed  to  others  that 
might  have  been  used,  Is  unaffected  by  the  lack  of  independence  of  the  error 
in  the  distance  functions.  The  test  statistic  is  t , = dX^r/S , where 

Ef  E dV 

dX  f - £ dX  f(a,,  af)/n,  S is  the  standard  error  of  the  difference 

C j<t  j dXgf 

and  n*’105.  The  necessary  assumption  that  the  population  of  differences  is 
normally  distributed  in  order  that  t^  has  the  t-disti ibulion  with  n-1 
degrees  of  freedom  appears  reasonable  in  this  context  of  estimation  errors. 
Consider  the  first  criterion  of  goodttess-of-f it . If  dg  actually  pro- 
duces values  of  ADg  lower  than  that  for  AD^  then  t £,  which  is  made  up  of 
terms  of  ADg  and  ADj , has  a negative  expectation.  Said  another  way,  if  dg 
is  "more  accurate"  than  d£  then  the  average  value  of  Xg(q,r)  should  be  less 
than  that  for  X^Cq.r).  As  the  computed  value  of  tg£  the  weight  of 

sample  evidence  favors  rejection  of  the  null  hypothesis  E( Xp (q, r) ] -t[X* (q , r ) ] c 0 
and  lends  statistical  significance  to  the  negative  value  of  dXg£.  T°  give 
substance  to  this  notion  a significance  probability  is  reported  as  the  area 
under  the  t-distrlbut  ion  with  104  degrees  of  freedom  in  the  interval  (-■>,  tg£>. 
Although  a t-test  could  be  devised  analogously  to  be  associated  with  the 
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Second  clitcrion  of  goo.hr.  . -of-flt  this  was  not  done  In  eider  to  avoid  uudin 
prol  i f ora  t ion  of  statistical  teste.. 

4.  Presentat ion  and  Analysis  of  Results 
The  results  of  fitting  the  five  distance  estimating  functions  to  the 
seven  sample  rets  of  url.n  and  rural  road  distances  are  summarized  in 
Tables  1-5.  The  statistical  values  associated  with  the  fitting  criteria  are 
given  in  Table  6.  Certain  observations  can  be  made.  The  goodr.ess-cf-f it  ' 
to  the  actual  road  distances  was  superior  to  that  of  d^,  the  other  1 1 . 
parameter  function,  in  every  case  except  for  the  rural  Wisconsin  data.  Statis- 
tical significance  accompanied  this  superiority  in  the  Columbus,  Milwaukee, 
Toledo  and  Toledo  Subarea  samples,  d^  includes  dj , d 2 and  dj  as  special  cases 
and  therefore  estimates  distance  at  least  as  accurately.  This  structural 
superiority  led  to  statistical  significance  at  every  opportunity  except  for 
d^,  and  dj  in  the  Midi  ■ on  end  rur„!  Wisconsin  r. ampler..  The  fitted  par.  far 
o£  d^  do  not  satisfy  convexity  conditions  for  cr.y  but  the  Madison  an  Mi  lvv  u’.  , 
data  sets.  This  result  1 ay  frustrate  the  use  of  d/t  in  facilities  location 
objective  functions  which  are  part  o£  urban  location  models.  A practical 
remedy  would  be  to  fit  t lie  parameters  of  d^  under  the  condition  p >_  s.  The 
simplifying  assumption  that  p * s suggested  by  previous  results  [0)  does  a at 
seem  justified  for  the  Ohio  data.  The  fitted  parameter  values  for  d-j  and  d^ 
satisfy  the  respective  convexity  conditions  in  every  case. 

The  rectangular  distance  function  dj  does  not  fare  well  in  relation  to 
the  alternatives,  in  most  cases  this  inferiority  was  substantiated  by  clear 
statistical  significance.  Indeed,  only  for  the  Milwaukee  sample  did  the 
accuracy  of  d^  surpass  even  that  for  the  Euclidean  function  dj.  The  computa- 
tionally convenient  rectangular  distances  may  therefore  provide  a relatively 
poor  fit  to  a variety  of  urban  data  bases. 
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Tabic  1 


* 


Snnpl  c 

AD 

Minimizing  Parameter r of  dj 

SI) 

i k e 

1 

V. 

{< 

Canton 

123.05 

(136.71) 

1.06(0.95) 

21 

14.11 

(17.86) 

1.03(0.96) 

20 

Col  umbos 

86.07 

(88.35) 

1.02(0.99) 

5 

10.53 

(10.91) 

1.01(1.00) 

5 

Madison 

1015.96(1057.83) 

0.98(0.97) 

4 

157.49(163.42) 

0.98(0.98) 

3 

Milwaukee 

628.95 

(727.1  0) 

0.95(0.95) 

5 

66.86 

(8,7.83) 

0.94(0.94) 

6 

Rural  Vis. 

508.62 

(515.81) 

1.05(1 .05) 

3 

94.95 

(96. 3S) 

1 . 04  ( 1 . ) 

3 

Toledo 

99.73 

(116.83) 

0. 97(0.99) 

15 

14.47 

(18.30) 

0.  98  (C  -) 

12 

Toledo  Sub. 

69.60 

(76.12) 

0.  97(0.96) 

15 

11.42 

(13.98) 

0.97(0.94) 

o n 

* 

Parenthetical 

values  correspond  to 

the  original  axes. 

Table 

2 

Minimi zing 

Tarar 

;e  t e r s o f d ^ 

Snmp) o 

AD? 

k 

SD. 

k 

Canton 

78.39 

1.20 

7.42 

1.22 

Coluubus 

80.20 

1.27 

8.78 

X 

CM 

H 

Mad  iron 

869.40 

1.25 

125.73 

1.25 

Milwaukee 

714.83 

1.16 

72.17 

1.18 

Rural  Vis. 

480.19 

1.35 

72.90 

1.34 

Toledo 

74.13 

1.21 

8.70 

1.24 

Toledo  Sub. 

49.17 

1.21 

7.18 

1.23 

-11- 


Table  3 


* 

Minimizing  Paranctcrs 


Sauple 

ad3 

klP 

sn3 

Canton 

65.62* 

1.16,1.49 

6.03* 



1.18,1.56 

Colunbus 

68. 34 

1.18,1.47 

6.86 

Kf\ 

*sT 

00 

•■H 

Madison 

830.34 

1.16,1.48 

113.24 

1.18,1.56 

* 

. , * 

Milwaukee 

508.81 

1.03,1.30 

46.31 

1.07,1.35 

Rural  Uis. 

452.74* 

1.24,1.45 

70.45 

1.29,1.6 

Toledo 

71.69 

1. IS, 1.74 

8.41* 

1.20,1.73 

Toledo  Sub. 

48.13* 

1.18,1.78 

7.07* 

1.21,1.81 

Entries  correspond  to  original  .axes  or  axes  rotated  through  the  angle  given 
in  Table  1,  whichever  case  yields  the  lower  criterion  value.  An  asterisk 
denotes  that  the  result  is  for  rotated  axes. 


Table  4 


* 


l'inir.i  zing  i’.  meter- 

of  tU 

Satnpl  e 

/J)4 

S,,4 

V. , ;■ . n 

Canton 

60.34* 

1.43,1.66,1.78 

4.10 

1.53,1.81,1.99 

Columbus 

53.76 

1.45,1.38,1.52 

4.39 

1.49,1.4] ,1.57 

Mad  1 ton 

824.20 

1.0) ,1.56,1.51 

113.20 

1.14,1.54,1.53 

Milwaukee 

508.81* 

1.06,1.30,1.30 

46.29* 

1.03,1.34,1.33 

Rural  Wls. 

447.10 

1.56,1.50,1.59 

70.24 

1.37,1.65,1.68 

Toledo 

54.90 

1.56,1.68,1.90 

4.95 

1.56,1.70,1.92 

Toledo  Sub. 

34.08* 

1.55,1.88,2.12 

2.69 

1.64,2.35,2.70 

*Scc  note 

for  Table  3. 

Table  5 


Sample 

ADy 

Mlnlnlzing  Parameters 

“l  . “2  • m3 

of  d ^ 

SD5 

Canton 

63.37 

1.47, 

-0.41,2.02 

5.17 

1.47,-0.29,1.92 

Columbus 

70.99 

1.46, 

-0.99,1.78 

7.52 

1.50,-0.04,1.81 

Hadlson 

855.30 

1.65, 

-0.01,1.46 

120.00 

1.64,  0.00,1.44 

Milwaukee 

657.70 

1.64, 

-0.09,1.27 

62.05 

1.56,-0.11,1.31 

Rural  Vila. 

430.60 

1.82, 

-0.48,1.90 

66.80 

1.8) ,-0.36,1.8 

Toledo 

67.90 

1.52, 

-0.23,1.37 

8.01 

1.57,-0.21,1 . 

Toledo  Sub. 

47.54 

1.50, 

-0.09,1.43 

6.77 

1.70,-0.03,1.44 

Table  6 

Values  of 

dX  c 2nd 

Significance 

Probabilities  (I\m 

entlietically) 

e. 

2 

3 

4 

5 

1 

-.09 

-.13 

-.15 

-.14 

(.00) 

(.00) 

(.00) 

(.00) 

2 

-.03 

-.06 

-.05 

f 

(.00) 

(.00) 

(.00) 

3 

Canton 

-.03 

-.01 

(.00) 

(.18) 

Sample 

4 

B 

.02 

(.87) 

2 

3 

4 

5 

1 

-.01 

-.04 

-.09 

-.03 

(.38) 

(.01) 

(.00) 

(.12) 

2 

-.03 

-.08 

-.02 

f 

(.00) 

(.00) 

(.02) 

3 

Columbus 

-.05 

.01 

Sample 

(.00) 

(.75) 

4 

.06 

(1.00) 
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Table  6 - Continued 


£ 


2 

3 

4 

-.20 

-.23 

-.23 

(.00) 

(.00) 

(.00 

-.03 

-.03 

(.25) 

(.20 

Had ison 

-.00 

Sample 

£ 

(.45! 

2 

3 

4 

.06 

-.05 

-.12 

(.61) 

(.11) 

(.oo: 

-.11 

-.18 

(.00) 

(.oo; 

Miluaul.t  c 

-.07 

Sanp] e 

£ 

(.co: 

2 

3 

4 

-.04 

-.07 

-.07 

(.22) 

(.01) 

(.03: 

-.03 

-.03 

(.09) 

(.09: 

Rural 

.00 

Wisconsin 

(.53: 

Table  6 - Cont inued 


I 


C 


2 

3 

4 

5 

1 

-.07 

(.00) 

-.07 

(.00) 

-.12 

(.00) 

-.06 

(.00) 

2 

-.00 

(.13) 

-.06 

(.00) 

-.01 

(.06) 

3 

Toledo 
Samp) e 

-.06 

(.00) 

-.01 

(.17) 

4 

.05 

(.99) 

e 

2 3 4 5 


1 

-.05  -.06 

-.15 

-.06 

(.00)  (.CO) 

( . 00) 

(.00) 

2 

-.00 

-.10 

-.0) 

(.20) 

(.00) 

(.01) 

3 

Toledo 

-.09 

-.01 

4 

Subarea 

Sample 

(.00) 

(.11) 

.09 

(1.00) 

The  Euclidean  function  which  models  road  distances  as  having  no  directional 
Mas  merge  .)■  preferable  to  dj  on  the  basis  of  accuracy  and  convcnlf  ice. 

Only  the  inflation  factor  U uui.t  be  estimated  and  an  optical  ccordlir  t syst< 
need  not  be  Identified.  If  this  optimality  were  not  Incorporated  Into  d^ 
through  the  angle  0 and  the  original  axes  were  used,  the  dominance  of  do  o vet 
d^  would  be  total.  The  values  of  k were  generally  greater  for  these  sampled 
distances  than  for  the  liter-city  distances  of  the  previous  study. 

The  two-parameter  function  cij  is  not  decidedly  Inferior  to  either  d or 

4 

dj,  each  of  which  requires  the  estimation  of  three  parameters.  The  vain.  of 
p for  dj  even  for  the  cases  of  rotated  axes  do  not  appear  to  be  consistently 
close  to  unity.  This  would  further  substantiate  the  lack  of  a strong  rectan- 
gular bias  In  the  road  networks  of  the  chosen  study  areas.  Addition  of  the 
parameter  0,  ns  In  d^ , to  provide  an  optimal  coordinate  system  could  only 
Improve  the  accuracy  of  dj.  This  was  not  done  here  In  order  to  keep  ihe 
computational  burden  within  tolerable  limits. 

The  'Vuadi.  tlc  forL."  function  dj  proves  to  be  relatively  accurate.  The 
absolute  v.ilu  of  is  typically  close  to  zero.  This  implies  that  th  major 
and  minor  axes  of  the  ellipse  defined  by  dj(q,0)  « 1 arc  generally  close  to 
coinciding  with  the  coordinate  axes  used  in  the  study.  Since  j*  m^,  the 
locus  of  points  satisfying  d^(q,0)  ■ 1 Is  indeed  an  ellipse.  The  parameters 
for  d^  using  rural  distances  provide  the  greatest  deviation  from  these  general- 
izations. Tor  this  sample,  ra^  = Since  nij  “ -0.4R  and  -0.36,  the  major 

axis  of  the  ellipse  points  In  a southwest-northeast  direction.  Additionally, 
d^  best  estimates  the  rural  distances  under  both  criteria.  The  directional 
bias  affotded  by  d^  may  have  special  application  to  such  rural  settings. 

The  rural  distances  sample  leads  to  the  greatest  value  of  k In  each  of 
d^,  d^  and  dj.  This  sample  docs  not  otherwise  lead  to  dlst lngulshably  different 
parameter  values  for  dj  through  d/(  In  relation  to  the  other  sampler).  The  valuer. 
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of  p show  no  tendency  toward  1 or  2 in  dj,  while  the  values  of  p and  s which 
are  slmil..r  for  d^  also  fall  almost  midway  between  1 and  2.  The  Toledo  sub- 
area  sample  allowed  for  lower  values  of  each  AD^  r.nd  Sr>£  than  those  for  t';a 
Toledo  sample  which  includes  some  points  on  opposite  sides  of  the  Maumee 
River.  The  parameter  values  of  the  functions  arc  different  for  the  tvo 
Toledo  samples. 

In  general,  the  parameter  values  for  each  function  were  different  for 
different  samples  and  for  each  criterion  of  goodnass-of -f it . In  the  previous 
6tudy,  parameter  values  were  relatively  the  same  for  each  criterion.  This 
signals  the  importance  of  fitting  an  empirical  distance  function  to  the  par- 
ticular area  of  study  when  a premium  Is  placed  on  accuracy.  There  is  a 
robustness  in  the  relative  accuracy  of  the  functions  but  the  analyst  must 
tailor  the  parameter  values  to  the  study  area  and  according  to  a useful 
criterion  of  goodnecs-of-f it.  In  order  to  Illustrate  how  sensitive  the 
criterion  functions  are  to  changes  in  the  parameter  v..locc,  ve  give  the  fr1- 
louiug  Ls.o  tables  c-f  criterion  values  in  a coarse  grid  cent  ered  at  the  opt  :r  1 
point.  In  both  cases  the  geographical  area  is  Canton,  the  criterion  is 
and  the  grid-width  is  0.1. 

Comparing  the  two  tables,  it  appears  that  for  dj  the  second  criterion  is 
convex  in  the  parameters  ctj,  and  Uj,  and  Is  only  slightly  sensitive  to 

changes  in  them.  For  d^  and  fixed  k the  second  criterion  seems  to  give  rise 
to  a surface  like  a flat-bottomed  valley  with  very  steep  sides.  This  difference 
accounts  for  the  relative  difficulty  in  estimating  the  parameters  of  the  second 
model.  For  this  model,  Tabic  7 also  indicates  that  parameter  values  in  an 
extended  region  may  give  n near-optimal  fit,  as  long  a6  the  proper  balance  is 
maintained  between  p and  s. 
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Tabic  7 


Values  of  SD4  About  k“1.6, 
P”).7,  r.^1.9  where  SD^  “ 4.8 


\ 

k*l . 5 
1.8  1.9 

2.0 

•\ 

k-1.6 
1.8  1.9 

2.0 

X 

k-1 

1.8 

.7 

1.9 

2.0 

1.6  ! 
1 

8 

46 

106 

1.6  | 

1 

4.9 

23. 

71. 

1 

i.6  i 

i 

15 

10 

43 

I 

18 

8 

43 

i.7  i 
1 

47. 

4.8 

20. 

i 

1.7  ! 
1 

94 

15 

8 

1.8  ! 

110 

17 

7 

1 

1.8  | 

187. 

45. 

4.8 

1.8  ! 

288 

90 

15 

Table  8 

Values  of  SD^  About  tee  Optirsjn 
rj'»1.5,  n2”-0-3.  r>j"1.9  where  S9j  - 5.2 


n>j*=1.4  la^l.S  mj*=1.6 


-0.3|  6.5  5.9  5.7  -0.3  | 5.4  5.2  5.5  -0.3!  6.7  7.0  7.5 


0.2  5.8  5.5  5.5  -0.2,'  5.5  5.6  6.0  -0.2  | 7.4  7.9  8.9 
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5.  Conclusions 


Empirical  distance  functions  arc  suggest  iJ  lor  1 r o .•«  J accuiacv  In 
predicting  actual  road  distances  between  two  points  In  an  area  unde  i stub.. 
Parameter  values  cay  be  estimated  upon  defining  a meaningful  g.  Inc: a.-of-l  it 
criterion.  The  standard  assumption  that  urban  distances  are  rectangular  ! 
not  supported  by  the  results  reported  ticre.  The  Euclidean  distance  is  r m 
convenient  to  use  and  appears  to  estimate  urban  distance  core  accurately 
unless  the  road  network  Is  endowed  with  a strong  rectangular  bi  ..  The 
fifth  distance  model  produced  reasonably  accurate  results.  Ho. ever  th.1 
function,  with  its  unique  directional  bias,  would  seem  to  be  especially  li- 
able where  t lie  road  network  is  not  highly  developed--such  as  in  certain  low 
population  density  regions.  In  addition,  t he  parameters  of  d^,  which  r.K 
satisfied  convexity  conditions,  were  more  easily  estimated  than  those  for  d , 
the  other  three-parameter  function,  d^  proved  to  be  most  accurate  f r in 
urban  data  sets.  However,  the  empirically  determined  par. meter  valves  t 
foiled  to  satisfy  convexity  conditions.  A remedy  would  be  to  erfo :<  •„!.  r 

vexity  conditions  in  the  parameter  stimatlon.  gave  relatively  uiu 

results  although  it  was  shown  to  be  inferior  to  the  more  general  d through  tati 

U 

tlcal  analysis.  If  a tiirce  parameter  function  is  to  be  considered,  ints 
the  parameter  0 into  the  definition  of  d-j  to  optimally  orient  the  coorc  tc 
system  would  combine  convexity  and  ease  of  parameter  estimation  with  accuracy.* 
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0024  and  a grant  from  the  Graduate  Scltool  of  the  University  of  Vi  scons  in- 
Mndlson.  James  Morris'  work  was  supported  by  a grant  froi  the  Graduate 
School  of  Kent  State  University.  The  assistance  of  the  Onio  Department  of 
Transportation,  especially  Charles  Cebhardt  and  Charles  Groves,  in  sup  plying 
the  Ohio  data  for  this  study  is  gratefully  acknowledged.  Henrik  Juc)  provided 
helpful  assistance  In  completion  oi  tiie  study.  Dean  L’ichern  provided  h.  1 p • »•  1 
discussions  of  the  statistical  analysis.  A special  note  of  thank:;  Is  extended 
to  the  anonymous  referee  whose  criticisms  ot  an  earlier  draft  Wd  to  substantive 
improvements  in  the  paper. 
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Convexity  Properties 

Proof  of  the  convexity  of  d^(q,r)  when  p 1 • p >_  s,  and  k >_  0: 


It  Is  sufficient  to  prove  that 


1 lqf  ' rJ 

1-1 


1/s 


Is  convex.  It  is 


known  that  d(q,  r;  1,  p,  p)  is  convex.  Let  g(y)  •*  y , y >_  0 and  b j>  1. 
Then  g is  an  Increasing  convex  function  of  y.  Letting  b'-p/s  we  can  write 


l hj  - rj 

1-1 


1/s 


g(d(q,  r;  1,  p,  p))  which  is  convex  for  p 1 , p >_ 


since  an  increasing  convex  function  of  a convex  function  is  convex. 

This  extends  and  generalizes  an  earlier  result  by  Cooper  [1]  that 
2 2 K/2 

<(x,y)  - [ (Xj  - x)  + (y^  - y)  ) is  a convex  function  of  (x,y)  for 

k > 1 and  Bj  ^ 0 since  tiic  above  proof  neither  requires  that  Xj  and  y^ 

be  const." nt^  nor  that  p"-2. 


Froof  of  I tie  convexity  of  d^(q,r)  when  }i  is  positive  definite: 

* • 

Let  (qx,  rx,  q2,  r2)  » (xx,  x2,  Xj,  x/()  ? x and  y = (yx,  y2>  y4) 

Then 

2 2 »< 
djfq.r)  " dj(x)  - [m1(x1  - x2)  + °2^xl  “ *2^x3  “ *4^  + n3^x3  " x4^  1 • 

To  prove  the  convexity  of  d^  we  oust  prove  that  for  any  x and  y and  any  non 
negative  numbers  a,  B,  where  a + B - 1,  the  inequality 

d^(ax  + By)  ad^(x)  + Bd^(y) 

must  hold. 

Since  M is  positive  definite, 
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0 £ (d^ (ax  + By)  )2  - 

a 

xrx2 

+ 8 

yry2 

1 

H 

a 

rx  -X 
X1  x2 

+ 8 

'y,-y2' 

X-j-X/ 

l 3 

y3“y4 

x3">;4 

y3"y4 

- 

J 

- 

- 

’ " 

| 

VX2 

M 

xl'x2 

yl"y2 

yry2 

<_  c 

+ 

B 

M 

X 

u> 

1 

X 

,X3-X4 

>N 

r'l 

?3~y4 

Sec  Hadley  [4,  p.  85], 


Taking  the  square  root  of  both  sides  of  the  second  inequality,  and 


noting  that  (a.  + 


a2) 


<_  ax  + 


>5  . 

a2  v.’hen 


a2  > 0,  we  can  write. 


dj(ax  + Cy) 


a 

xrx2 

« 

M 

fx  -x  'I 
1 2 

‘ 

>S 

+ 

8 

yry2 

I 

H 

yry2 

X3‘X4 

x3-x4 

^ J 

y3~y4 

y3-y4 

< ad5(y.)  + 8d5(y)  , 
which  is  the  desired  result. 
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